Unfolding and unzipping of single-stranded DNA by stretching 
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We present a theoretical study of single-stranded DNA under stretching. Within the proposed 
framework, the effects of basepairing on the mechanical response of the molecule can be studied 
in combination with an arbitrary underlying model of chain elasticity. In a generic case, we show 
that the stretching curve of ssDNA exhibits two distinct features: the second-order "unfolding" 
phase transition, and a sharp crossover, reminiscent of the first-order "unzipping" transition in 
dsDNA. We apply the theory to the particular cases of Worm-like Chain (WLC) and Freely- Joint 
Chain (FJC) models, and discuss the universal and model-dependent features of the mechanical 
response of ssDNA. In particular, we show that variation of the width of the unzipping crossover 
with interaction strength is very sensitive to the energetics of hairpin loops. This opens a new way 
of testing the elastic properties of ssDNA. 

PACS numbers: 87.14.Gg, 82.37.Rs, 64.90.-Fb 
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I. INTRODUCTION 



A dramatic progress has been made over last decade in 
employing the single-molecule micromanipulation tech- 
niques for studies of biological materials and processes. 
Pioneered by the work of Smith et. al. [1] on stretch- 
ing of double-stranded DNA (dsDNA) , these techniques 
have later been applied to study proteins, DNA-protein 
interactions, chromosomes etc. The chain-stretching ex- 
periments were also performed on single-stranded DNA 
molecules (ssDNA)[2]-[5]. 

One could expect the response of ssDNA to stretch- 
ing to be dramatically different from that of dsDNA, be- 
cause of the effects of the possible basepairing between 
complementary segments of the chain. The need for un- 
derstanding of the resulting mechanical behavior has al- 
ready attracted a considerable attention of theorists [6]- 
[10]. In particular, it has been shown [7]- [9] that in 
the thermodynamic limit, ssDNA chain should undergo 
the second-order phase transition, at finite critical force. 
Formally, this phenomenon is very similar to the hypo- 
thetical native-molten transition in RNA [11], as well as 
to the classical (yet unconfirmed) picture of dsDNA de- 
naturation [12]. 

The predicted critical behavior is qualitatively differ- 
ent from another related phenomenon, the force-induced 
denaturation (unzipping) of dsDNA [13]-[14]. The un- 
zipping is a first-order transition, which occurs as a re- 
sult of competition between the elastic energy of the 
stretched ssDNA, and the basepairing (hybridization) en- 
ergy within dsDNA. The very same effects are essential 
for the above second-order phase transition in stretched 
ssDNA. As a part of the present work, we will clarify the 
relationship between the two phenomena. 

The theoretical modelling of ssDNA and RNA is tradi- 
tionally done within Freely Joint Chain (FJC) model. Its 
extensible version has been originally used for fitting of 
the early ssDNA stretching data [2]. However, both the 
microscopic structure of ssDNA, and the recent experi- 
ments with DNA hairpin constructs [15]-[16] , strongly 



suggest the picture with a finite bending modulus of its 
backbone, which feature is reminiscent of the semiflexible 
Worm-Like Chain (WLC) model [17]. Nevertheless, be- 
ing a continuous model, the WLC description is unlikely 
to be valid in the regimes when discrete nature of chem- 
ical bonds becomes relevant (e.g. for a sufficiently high 
stretching force). To overcome this limitation. Discreet 
Persistent Chain (DPC) model has been proposed for ss- 
DNA in recent work by Storm and Nelson [18]. Inter- 
estingly, WLC itself was originally introduced by Kratky 
and Porod [17] as a continuous limit of a similar dis- 
creetized model. The authors of [18] have shown, that 
the extensible versions of all three models (FJC, WLC, 
and DPC) produce a good fit to the experimentally ob- 
served stretching curves, as long as force is not too high 
(/ < 200pN for FJC, and / < iOOpN for WLC/DPC). 

The central idea of our paper is to include the effects 
of basepairing within a theoretical framework compatible 
with an arbitrary underlying model of ssDNA elasticity. 
In this way we can separate the two parts of the problem: 
the search of an adequate elastic description of ssDNA, 
and the evaluation of the effects of its self-interactions. 
In fact, one may be able to start with an empirical elastic 
models extracted from independent experiments, and use 
our theory to predict the stretching behavior of the chain 
after "switching on" the basepairing interactions. 



II. THEORETICAL FRAMEWORK 

We consider ssDNA chain subjected to external pulling 
force /. As an input for our theory, one has to spec- 
ify two functions characterizing the system without self- 
interactions: Qeiif) - elastic free energy per unit chain 
length vs. pulling force; and FioopiJ) ~ free energy of a 
loop as a function of its contour length. Our goal is to 
study the effect of interactions between complementary 
segments of the chain. Similarly to ref [7] , we assume the 
interaction strength to be the same for any two chain 
fragments. It will be characterized by a single param- 
eter, e, defined as pairing energy per unit chain length. 
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Strictly speaking, this should limit applicability of our 
approach to self-complementary periodic sequences of ss- 
DNA (such as ATATAT...). However, as discussed in [7]- 
[8], this uniform model may be reasonably adequate for 
random sequences, too. Nevertheless, the effect of ran- 
domness remains an important problem for the future 
studies. 

Hybridization of distant chain segments results in loop- 
ing. Therefore, the interactions reduce the effective (free) 
chain length I exposed to the stretching force. Our 
goal is to calculate the partition function of the system 
Z{L,l), parameterized by the total chain length L, and 
free length I. The general form of the interaction-free 
partition function is, Zo{L,l) — exp (/ioi) (5 (L — /), be- 
cause / and L coincide, and different chain segments are 
statistically independent (we neglect excluded volume ef- 
fects). Without loss of generality, one can choose iiq — 0. 
It is useful to perform double Laplace transform of parti- 
tion function Z{L,l). This results in introduction of pa- 
rameters jj and q, conjugated to L and / respectively (pa- 
rameter ^ conjugated to the total length, is often called 
fugacity): 



(a) 



Z{ii,q) 



Z{LJ)e^p[~fiL~ql]dLdl. (1) 



In particular, ZQ{fi, q) = 1/ {fi + q). Since the elastic part 
of free energy is qei{f)l, the overall free energy may be 
expressed as 



F(L,/) = -log / Z(M,geK/))e^^;^. 



(2) 



Here and below, we take ksT = 1. 

It is well known that the partition function of a uni- 
form RNA or ssDNA may be calculated in a recursive 
manner, reminiscent of Hartree approximation [6]- [11]. 
This calculation can be represented in a diagrammatic 
form shown in Figure 1. The solid lines correspond to 
bare partition function Zq; dotted fragments connected 
with dashed lines represent pairing (hybridization) of the 
corresponding chain segments. Within our approach, the 
energy cost of the pairing is eLhyb + Fioop{l), where Lhyb 
is the length of the hybridized region, and I is the free 
length of the chain segment being "internalized" due to 
the looping. In fact, one can extend our approach to a 
more general form of hybridization energy, Eq + ^Lhyb, 
with constant eo > representing the energy cost of ter- 
mination of the double-stranded segment. Introduction 
of £o accounts for the cooperativity of the basepairing 
interactions. 

Traditionally, problems involving DNA/RNA folding 
or denaturation, are studied within a discreet model. 
Each discrete "monomer" in this approach represents a 
chain segment which can hybridize independently of its 
neighbors. Its length is assumed to be equal to the sta- 
tistical segment, which makes it easy to combine this 
description with FJC model. Since we are interested in 




(c) Dyson Equation: 
— = + 



_' ' ' 



L" Z(L',I) 



FIG. 1: Schematic representation of partition function calcu- 
lation. Thin solid lines represent Zq, while dashed line indi- 
cate base-pairing of distant chain segments (dotted). 



developing a theory for an arbitrary model of chain elas- 
ticity, it is logical to abandon this artificial discretization 
of hybridized segments. Of course, the basepairing re- 
mains fundamentally discreet on the length scale of a 
single base. As long as the relevant physics occurs on 
larger scales, ssDNA can be considered as a continuous 
chain. 

The crucial observation is that one can typically ne- 
glect all the diagrams with intersecting dashed lines. 
Such situation would correspond to "pseudo-knot", 
whose probability is low because it requires winding of ss- 
DNA/RNA around itself (Fig. lb). Thanks to this topo- 
logical constrain, "self energy" diagram entering Dyson 
equation (Fig Ic), may be calculated exactly within one- 
loop (Hartree) approximation. Since we have assumed 
the uniform interaction parameter, the problem has a 
particularly simple form at the Laplace representation: 



Z ^in,q) = Z^ ^Ui,q) - 







X exp [-fi {2L" + L') - (L" - Fioop{l) - eo] ■ 



From here, one can obtain: 



W{q^ 



2n + e' 

were corresponds to the pole in Z{fi,q), and 



(4) 



W{q) 



exp [-Fioopil) + ql] dl. 



(5) 



III. 



GENERIC BEHAVIOR: "UNZIPPING" 
VERSUS "UNFOLDING" 



3 



In the thermodynamic hmit {L — > oo), 



+100 



dq 



log 



Qelif) 



(6) 

Here function /x, (q) again corresponds to the pole in 
Z{iJ,,q), i.e. it is inverse to (yu). Explicitly, 



(7) 



This yields the following result for the free length as a 
function of tension: 



l_ 
L 



qei-e/2 + W'iqei) 
^{qei~e/2f +2W{qei), 



(8) 



Here W' = dW/dq. Note that from the definition of func- 
tion W, Eq. (5), one can relate its logarithmic derivative 
to the average length of the loop: 



W 

hoopio) = 



(9) 



If I is known, one can easily find the relative elongation 
of the chain, i.e. the ratio of the cnd-to-cnd distance R 
to the total chain length L, which is observable experi- 
mentally: 



R _ , _ 1 dqei 
i~ 2df 



qei-e/2 + W' {q,i) 



\/i'lei-e/2f 



2W{qei)^ 



(10) 

Hero X = —dq,,i/df is the relative elongation of the free 
portion of the chain. 

According to Eq. (8), in the strong stretching limit 
{qei — cc) the chain becomes completely "free": I L. 
Further examination of our expression for the free length 
reveals its peculiar behavior near the point qei = e/2. In 
fact, in the limit of vanishing W (i.e. very high energy 
cost of a loop), I (qei) becomes a step function changing 
from to i at that point. In a realistic situation, it is 
transformed into a crossover whose width depends on W, 
as 



6g = y/2W {e/2). 



(11) 



Since the location of that crossover corresponds to the 
point were the hybridization free energy is exactly equal 
to the elastic one, we conclude that this behavior is di- 
rectly related to the first-order unzipping transition of 
dsDNA. The transition is transformed into the crossover 
due to the finite size of the hybridized segments. 

Another important feature of our result is that the 
free fraction of the chain l/L goes to zero at finite tension 



qei = q* ■ According to Eq. (8) the condition for this to 
happen is. 



q* - e/2 = 



W{q*) W {q*) 



W {q*) 
1 

hoop{q*) 



2 

koopjq*) 

'0 



(12) 



W{q*) 



This point corresponds to the second order phase transi- 
tion, which has been reported in the earlier studies of the 
problem. The crucial observation is that this transition is 
physically distinct from the unzipping crossover. It may 
be viewed as a precursor of the force-induced denatura- 
tion. Indeed, even when the tension in the free segments 
is still insufficient to overcome the binding energy — e, 
one can convert a finite fraction of the chain into the free 
length without breaking any bonds. Namely, since there 
is a finite density of loops at the folded ssDNA, one can 
simply "move" the unbound bases from the loops to the 
free segment. This will only result in an entropy loss, 
but will not reduce the interaction energy. 

Indeed, if the basepairing energy is negative, the mag- 
nitude of the critical tension is somewhat lower {\q*\ < 
—e/2) than that at the unzipping point: 



koop{q*) 



(13) 



The free segments of the chain exposed to the external 
stretching can be viewed as topologically distinct objects 
("vortices"), which separates the system onto indepen- 
dently folded domains. The effective interaction poten- 
tial of two consequent vortices is determined by the free 
energy of the folded region between them. At the point 
q* , the typical separation between the vortices along the 
chain coordinate, i.e. the size of the separable domains, 
diverges. This folding-unfolding transition occurs as a re- 
sult of competition between the elastic tension and fold- 
ing entropy, which grows with the domain size. 

Interestingly, Eq. (12) has physical solutions even for 
positive interaction energy, i.e. when the basepairing is 
energetically unfavorable. It follows form that fact that 
for lioop — > 00, the generic behavior of looping free en- 
ergy is FioopH) — 3/21ogZ, (for an ideal Gaussian chain). 
As a result, W (q) remains finite for q — > , while W 
and hoopiq*) are diverging, as q'^~'^. Therefore, according 
to Eq. (12), there is a finite tension at which the unfold- 
ing transition takes place, for e > 0. The asymptotic 
relationship between e and q* in this regime is, 



hoopiq*) 



fwio) 



(14) 



This yields a power law dependence of the critical tension 
on e: q* ~ e~^. Note however that our conclusion about 

the unfolding transition at positive values of basepairing 
energy, may well be a result of limitations of the model. 
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IV. APPLICATION TO WORM-LIKE CHAIN 
MODEL 

As an example, we discuss the application of the above 
approach to the particular case of WLC model. Its 
Hamiltonian is given by 



H 



ds 



(15) 



were Ip is persistence length, and r (s) defines the spa- 
tial conformation of the chain, subjected to constrain 
\dr/ds\ — 1. For this model, Marko and Siggia [17] have 
proposed the following interpolative relationship between 
the stretching force / and relative extension x = R/L. 





1 



(1 



- 1 



(16) FIG. 2: Looping probability, exp {—Fioop), as a function of 



From here, function Qeiif) (which is one of the inputs for 
our theory), can be expressed in parametric form: 



qeiif)= J fix')dx'~xf{x)^~ — 

^ 



1 



{i-xY 



(17) 

One has also to specify the loop free energy. An ana- 
lytic interpolation for Fioop was proposed in Ref. [19], in 
the context of the ring cyclization problem: 



vo 



.= (2^)%xp(-24i. + J-).i 



lib. 

4 / 



5P 



(18) 

Here vq < lA^ is the effective "reaction volume" asso- 
ciated with the localization of the loop ends by the hy- 
drogen bonding. We have found a simpler version of the 
interpolative expression, which gives a very good fit to 
the global behavior of the looping probability, e~^'°°'' 
(see Figure 2): 



nip {I — Ip) 



3/2 



4.5 



(19) 

Here is the step function. 

Within this approximation, function Wwlc (?) can be 
found analytically: 



W{q) ~ -^exp -3^-2qlp 
ip 



Here 



V6 



TT /3 



(20) 



(21) 



Note that a replaces three parameters of the original 
model, Vq, Iq, and energy Eq. Since S < Iq, and Eq > 0, 



loop size for different models of ssDNA elasticity. Here the 
solid line represents the analytic result for WLC, Eq. (19), 
and points correspond to FJC model. Insert: comparison of 
classical interpolative result for looping probability (from Ref. 
[19], dashed line), with the simplified formula, Eq. (19). 



this dimensionless parameter is expected to be small. It 
has a physical meaning of the effective reaction volume 
of the loop ends, in units of Iq. 

Now that we have specified Wwlc (?) and q{f), Eqs. 
(6)-(7), can be used to find /(/), together with force- 
extension curves, R{f) — l{f)x{f). The fact that a is 
a small parameter allows us to simplify the results. In 
the regime of negative e, a sharp unzipping crossover is 
expected. As we have discussed in the previous section, 
its characteristic width is given by 



2a 



It, 



■ exp 



(22) 

In the vicinity of the crossover point, q^i = e/2, the 
change in free length can be well described by the uni- 
versal fmiction of rescaled variable A = (g — e/2) /5q: 



L~2V' VA^ 



1 



(23) 



For large enough A, this universal behavior breaks down 
due to the proximity of the unfolding phase transition. 
The corresponding critical tension q* is given by: 



1 



^WLC — 2 



1 



/p(l + 3/V^) 



(24) 



Figure 3 shows the sharp crossover at |g(/)| — e/2. 
Consistently with Eq. (22), its sharpness increases with 
lowering parameter a. This can be associated with 
growth of the typical length of hybridized regions. The 
unzipping is clearly separated from second-order un- 
folding transition. While this crossover regime was not 
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FIG. 3: Theoretical stretching curves for WLC model with 
self-interaction. Dotted line corresponds to interaction-free 
WLC. 



present at the early studies of the problem, it has been 
recently reported in Ref. [9]. In that work, the tra- 
ditional FJC-based model has been modified to include 
the effects of cooperativity, analogous to our parameter 
Eq. As we have discussed, the sharp crossover should be 
interpreted as unzipping (force induced denaturation) , 
while the second-order transition corresponds to topo- 
logical change (unfolding) which may be viewed as a pre- 
cursor of the unzipping. This physical picture is consis- 
tent with the results of Ref. [9]. On the other hand, our 
analysis disagrees with the conclusions of Ref. [10], in 
which the first order phase transition was predicted for 
the regime of strong enough hybridization energy. 



V. MODEL DEPENDENCE OF THE RESULTS 

Here we discuss how the above results may depend on 
the choice of the elastic description of ssDNA. It should 
be noted that such important features, as the unfolding 
transition and the unzipping crossover, are very robust 
and nearly independent of the model. Furthermore, it is 
well known the stretching curve of ssDNA may be fitted 
reasonably well by several models, e.g. extensible ver- 
sions of FJC, WLC, or DPC. This imphes that deduction 
of Qei (/) from the existing and future experimental data 
is unlikely to provide a sensitive test for the possible mod- 
els. On the other hand, we have seen that the loop free 
energy Fioop (l) and consequently W (q) are significant 
parameters of the problem. As we shall see, this param- 
eters are very sensitive to the choice of the underlying 
elastic model. 

In the case of Discreet Persistent Chain (DPC) model, 
we do not expect any significant deviation of -Fioop (0 '^^ 
W (q) from those obtained for WLC model, since the typ- 



ical bending radius significantly exceeds the bond length. 
The (extensible) FJC model has been a standard frame- 
work in which the discussed problem has been studied 
so far. Nevertheless, here we briefly review the results 
of our theory for FJC, in order to identify the major 
model-dependent features. The freely joint chain consists 
of discreet bonds of length a = 2lp, whose orientations 
are mutually independent. The corresponding loop free 
energy can be written as. 



exp [-Fioop[l)] 



3/2 



,3/2 



(25) 



The prefactor here ensures that the asymptotic behavior 
of the free energy at the large-loops limit coincides with 
the WLC result, given by Eq. (18). Now, one can find 
W (g) for FJC model: 



W{q) 



3^ 



exp {2qlpn) 

„3/2 



(26) 



For negative e, the change of the elastic description 
results in a modest shift of the critical tension q* at which 
the unfolding transition occurs: 



1 



Ifjc — 2 



hoop{(-/2) 



o + TTT (27) 



This should be compared to WLC result, Eq. (24). 

While the position of the unzipping point, qei — e/2, 
is model independent, the width of the crossover is very 
sensitive to the behavior of W (<?) . As one can see in 
Figure 4, the shapes of W (q) curves are substantially 
different for WLC and FJC models. This difference ar- 
rises because of the relative suppression of the short loops 
in WLC case. What is especially remarkable is that the 
width of the unzipping crossover, 6q = \/2W (e/2) de- 
creases with e in a strongly model dependent manner. 
In fact, upon change of elp from (dsDNA denaturation 
point) to 3kT (which roughly corresponds to physiologi- 
cal conditions for -GCGCGC- sequence), the ratio of 6q 
for the two models changes by factor of ~ 3.5. There- 
fore, an experimental study of the unzipping crossover 
at variable conditions (e.g. temperature), would open a 
new possibility of testing the plausible models of ssDNA 
elasticity. 



VI. CONCLUSIONS 

In this paper, we have discussed the effects of basepair- 
ing on the stretching behavior of ssDNA, within a theo- 
retical framework compatible with an arbitrary underly- 
ing model of chain elasticity. Our conclusion is that in 
a generic case, the stretching curves exhibits two related 
but distinct feature: the second-order unfolding phase 
transition, and the sharp unzipping crossover. The lat- 
ter is reminiscent of the first-order transition in dsDNA, 
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FIG. 4; a) Comparison of W{q) functions calculated for 
WLC and FJC models. Insert: relative width 5q of unzip- 
ping crossovers, for FJC and WLC, as a function of base- 
pairing energy, b) Unzipping behavior of WLC (solid) and 
FJC (dashed) systems for the basepairing energy changing 
between e = and e = —ksT/lp. For the both models, we 
put a = 0.01. 



is due to the competition of conformational entropy and 
elastic free energy, and it is expected to occur even in the 
regime when basepairing is energetically unfavorable. 

In the light of our results, one can see a clear rela- 
tionship between the three types of the force-induced de- 
naturation: (i) unzipping of dsDNA[13]-[14], (ii) denat- 
uration of RNA with a preferred secondary structure[6], 
and (iii) stretching a self-complimentary or random ss- 
DNA. While in the case of dsDNA the unzipping occurs 
as a first order transition, it becomes a crossover for the 
two other cases. The width of the crossover is defined 
by the typical length of a single hybridized region. We 
expect this width to be sensitive to the sequence of ss- 
DNA/RNA. In fact, the force-induced denaturation of 
RNA [6] was predicted to show a sequence of unzipping 
steps. It follows from our discussion that the sharpness 
of those steps may be even more pronounced than was 
originally predicted within FJC-based model. In the case 
of ssDNA, the sequence disorder must result in smearing 
of the unzipping crossover. Because of its cntropic na- 
ture, the unfolding transition is expected only in the case 
of ssDNA (or long RNA, in molten phase [11]). 

At present, the experimental indications of the second 
order unfolding transition arc not conclusive enough [3]- 
[4]. On the other hand, experiments with uniform self- 
complimentary DNA show a clear manifestation of the 
sharp unzipping crossover [5]. However, their precision is 
still insufhcient to make a quantitative comparison with 
the theory, and to distinguish between different underly- 
ing elastic models. Based on our theory, one may extract 
this information by performing a systematic experimen- 
tal study of the unzipping behavior for various values of 
hybridization energy (e.g. various temperatures). As we 
have shown, the width of the crossover is very sensitive 
to the energy cost of the hairpin loop. 



as well as to the mechanical response of non-random 
RNA molecules[6]. On the other hand, we have interpret 
the unfolding as a topological transition. At the critical 
point, the typical size of independently folded domains 
diverges (in the thermodynamic limit). This transition 
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